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1.0  INTRODUCTION 

This  document  summarizes  work  performed  by  Delta  Information  Systems, 
Inc.  (DIS)  for  the  National  Communications  System  (NCS),  Office  of  Technology 
and  Standards.  This  office  is  responsible  for  the  management  of  the  Federal 
Telecommunications  Standards  Program,  which  develops  telecommunications 
standards,  whose  use  is  mandatory  for  all  Federal  departments  and  agencies.  The 
purpose  of  this  project,  performed  under  contract  number  DCA100-83-C-0047, 
was  to  generate  a  new  set  of  standard  images  that  can  be  used  by  experimenters 
developing  bi-level  compression  algorithms. 

The  NCS  has  been  a  leader  in  the  development  and  promulgation  of 
standardized  imagery  for  facsimile.  The  NCS  sponsored  the  scanning  of 
documents  used  by  the  International  Telegraph  and  Telephone  Consultative 
Committee  (CCITT)  at  resolutions  of  200,  240,  300,  400  and  480  lines  per  inch 
and  stored  the  resultant  data  on  tape.  This  data  has  been  used  by  many 
experimenters  in  the  development  of  standard  compression  algorithms  for  digital 
facsimile.  This  work  contributed  significantly  to  the  development  of  high  resolution 
facsimile  standards  which  will  be  of  considerable  value  to  the  U.  S.  Government. 

At  the  present  time,  a  joint  International  Standards  Organization  (ISO)/CCITT 
Bi-level  Image  Group  (JBIG)  is  engaged  in  an  effort  to  define  a  new  progressive 
compression/decompression  algorithm  for  use  with  bi-level  images.  Critical  to  the 
evaluation  and  selection  process  needed  to  choose  from  a  number  of  candidate 
algorithm  submissions  is  a  set  of  digitized  test  images  against  which  the  various 
candidate  algorithms  can  be  applied.  The  purpose  of  compiling  a  standard  set  of 
bi-level  images  is  to  make  the  results  presented  by  various  experimenters  in  the 
field  directly  comparable,  without  regard  to  differences  in  image  content.  As  a 
result  of  this  project,  all  experimenters  engaged  in  performing  studies  for  the 
CCITT  regarding  bi-level  transmission  techniques  will  have  access  to  magnetic 
tapes  or  DOS  diskettes  containing  digitized  versions  of  the  standard  images.  In 
addition,  the  images  will  be  useful  in  evaluating  graphic  printer  quality  and 
capability. 

This  report  is  comprised  of  five  sections  plus  an  appendix.  Section  1 
provides  a  brief  description  of  the  objectives  of  the  study  and  an  outline  of  the 
contents  of  this  report.  Section  2  describes  the  selected  set  of  standard  bi-level 
images.  Section  3  describes  the  desired  bi-level  image  content  generated  primarily 
by  participants  in  the  JBIG  work.  Section  4  describes  how  the  images  were 
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assembled,  and  Section  5  is  a  summary  of  the  work  performed.  The  appendix  is  a 
user's  guide  that  contains  copies  of  the  images,  some  statistical  information  on 
their  content,  and  the  magnetic  media  format  information  required  for  their  use. 

2.0  SELECTED  IMAGES 

Each  of  the  selected  images  is  described  below.  Each  image  is  categorized 
according  to  name,  dimensions,  pel  density,  and  image  content  category  number 
(CC#).  Image  content  categories  are  described  in  more  detail  in  the  next  section. 
Content  categories  (CC#'s)  are  defined  as  follows: 

CC#  Category  Description 

1  Representative  imagery  from  CCITT  documents 

2  Characters,  legibility  test 

3  Engineering  Drawing  /  Line  Art 

4  Halftones 

5  Test  Chart 

6  High  Resolution  Imagery 

The  Appendix  following  this  report  is  a  User's  Guide  that  includes  copies  of 
all  of  the  images  with  the  information  necessary  for  accessing  them  from  either 
magnetic  tape  or  DOS  diskette. 

2.1  Facsimile  Test  Chart  (Figure  A-1) 


Image# 

Image  Name 

Dimension 
(W  X  H) 
in  bits 

Dimension 
(W  X  H) 
in  inches 

Pel  Density 
in  bpi 

CC# 

1 

Facaimile  Test  Chart 

3500  X  4700 

8.75x11.75 

400 

5,2,4 

This  Facsimile  Test  Chart  is  being  progressed  as  a  U.  S.  standard  by  the 
Telecommunications  Industry  Association  TR-29  Committee.  It  is  also  being 
progressed  internationally  by  CCITT  SG  VIII.  The  complete  page  at  a  pel  density  of 
400  pels  per  inch  is  included  in  this  image  set.  The  patterns  that  make  up  the 
chart  are  described  below.  Note  that  Figure  A-1  has  been  reduced  to  fit  into  this 
report. 
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1.  CCITT  border  of  4  scales  with  millimeter  markings.  The  5  and  10  mm 
lines  are  extended  as  shown.  Top  and  bottom  scales  are  190  mm.  The  edge 
markings  from  the  bottom  extend  up  to  260  mm.  The  top  border  of  the  upper 
horizontal  line  is  279  mm  above  the  corresponding  bottom  line.  This  line  is  1/8 
inch  high  and  extends  from  edge  to  edge  of  the  chart  width  (8  3/4  inches).  The 
arrows  near  the  ends  of  this  line  are  8  1/2  inches  apart  and  centered  on  the  edge. 

2.  Scale  in  inches  across  the  top,  starting  from  0  in  the  middle  of  the  page 
with  .1  scale  markings.  The  border  at  the  left  side  of  the  chart  is  marked  in 
inches,  starting  at  the  top  of  the  chart. 

3.  Four  patterns  of  truncated  fan-type  multiple-line  pattern  with  low  taper 
rate.  The  larger  ones  are  calibrated  in  black  plus  white  lines  per  inch,  and  the 
smaller  ones  are  calibrated  in  microns. 

4.  Gurley  type  Pestrecov  Star  pattern  with  circles  of  50,  100  &  200  LPI. 

5.  Alternating  black  and  white  lines.  Upper  pattern  is  150  lines  per  inch, 
inclined  at  3  degrees  from  vertical.  The  lower  pattern  is  200  lines  per  inch, 
inclined  at  2  degrees  from  vertical.  The  angle  is  to  allow  the  lines  to  drift  through 
a  match  and  a  mismatch  with  the  photosensor  array  elements. 

6.  B/W  bar  patterns  of  100,  150,  200,  300,  400  and  600  LPI. 

7.  Isolated  black  and  white  lines.  The  vertical  pattern  is  inclined  at  5 
degrees  from  vertical. 

8.  NBS  type  resolution  pattern  calibrated  in  line  pairs  (black  plus  white)  per 
millimeter.  Smallest  patterns  are  near  the  center  of  the  chart. 

9.  Tapered  isolated  black  and  white  line  patterns  with  the  line  width 
calibrated  in  microns  or  inches. 

10.  B/W  bar  pattern  of  5  black  plus  white  bars  per  inch. 

11.  Parallel  lines  inclined  at  5  degrees  from  vertical. 

12.  NBS  type  resolution  pattern  calibrated  in  line  pairs  (black  plus  white)  per 
millimeter.  This  pattern  has  been  rearranged  to  place  the  smallest  patterns  near 
the  center  of  the  edge  of  the  chart. 

13.  Diagonal  line  about  .01  inches  thickness.  For  checking  irregularities  in 
vertical  pitch.  Received  lines  with  errors  will  show  breaks  or  steps  of  this  line. 

14.  ISO  character  hexagonal  line  patterns  1,  2,  3,  &  4.  For  readability 
testing. 

15.  Halftone  dot  screens  of  10,  50  and  90%  black.  The  65  and  120  are  the 
number  of  dots  per  inch  measured  at  a  45  degree  angle. 
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16.  Line  crossing  pattern.  Pattern  is  about  3  inches  long.  Line  thickness  is 
about  0.007  inches.  The  center  to  center  line  separation  is  0.15  inches  on  the  left 
end  and  0.05  inches  on  the  right  end.  The  number  of  scanning  line  crossings  of 
both  tines  multiplied  by  10  is  the  vertical  line  pitch. 

17.  Text  in  English,  Arabic,  Chinese,  Russian,  Spanish  and  French.  English 
text  is  in  12,  10,  8,  6,  4  &  2  point  sizes. 


2.2  Business  Letter  (Figure  A-2) 


Image# 

Image  Name 

Dimension 
(W  X  H) 
in  bits 

Dimension 
(W  X  H) 
in  inches 

Pel  Density 
in  pels 
per  inch 

cc# 

2 

Business  Letter 

3456  X  4416 

8.6  X  11 

1 

400 

1 

DIS  prepared  the  Business  Letter  image  for  the  JBIG  evaluation.  This  image 
is  similar  to  the  previously  scanned  French  CCITT  document  #1.  It  is  included  in 
the  set  to  provide  a  baseline  for  comparing  new  algorithms  with  existing  ones  on 
this  class  of  imagery.  In  addition,  it  provides  another  set  of  data  to  that  already 
available  to  experimenters. 


2.3  Technical  Paper  (Figure  A-3) 


Image# 

Image  Name 

Dimension 
(W  X  H) 
in  bits 

Dimension 
(W  X  H) 
in  inches 

Pel  Density 
in  pels 
per  inch 

3 

Technical  Paper 

3456  X  4416 

8.6  X  11 

400 

The  Technical  Paper  image  was  also  used  by  JBIG  in  their  algorithm 
evaluation.  This  image  is  similar  to  the  previously  scanned  French  CCITT 
document  #5. 


2.4  Handwriting  (Figure  A-4) 


■ 

Image  Name 

Dimension 
(W  X  H) 
in  bits 

Dimension 
(W  X  H) 
in  inches 

Pel  Density 
in  pels 
per  inch 

cc# 

4 

Handwriting 

3072  X  4352 

7.7  X  10.9 

400 

1 

The  Handwriting  image  also  was  originally  part  of  the  JBIG  Evaluation  Set. 
This  image  is  representative  of  the  previously  scanned  French  CCITT  document  #2. 


2.5  Japanese  Newspaper  (Figure  A-5) 


Image# 

Image  Name 

Dimension 
(W  X  H) 
in  bits 

Dimension 
(W  X  H) 
in  inches 

Pel  Density 
in  pels 
per  inch 

cc# 

5 

Japanese  News. 

3072  X  4352 

7.7  X  10.9 

400 

2 

This  document  is  an  electronically  scanned  Japanese  newspaper.  This 
Image  demonstrates  the  effectiveness  of  candidate  compression  algorithms  to 
compress  Kanji  text.  It  also  can  be  used  to  evaluate  printer  quality. 


2.6  Mixed  Text  and  Screened  Halftones  (Figure  A-6) 


Image# 

Image  Name 

Dimension 
(W  X  H) 
in  bits 

Dimension 
(W  X  H) 
in  inches 

Pel  Density 
in  pels 
per  inch 

cc# 

6 

Mixed  Test  and 
Screened  Half¬ 
tones 

3680  X  3578 

■ 

800 

2,4, 5, 6 

This  document  consists  of  mixed  text  and  screened  halftones.  The  mixed 
text  portion  of  the  document  includes  text  at  3  different  point  sizes,  10  point,  6 
point  and  3  point.  The  screened  halftone  portion  of  the  document  employs  a 
photo*mechanical  screening  process  and  contains  screened  halftones  at  2  different 
screen  sizes  -  a  65  line  screen  and  a  150  line  screen. 
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2.7  Legibility  Chart  (Figure  A-7) 


Image# 

Image  Name 

Dimension 
(W  X  H) 
in  bits 

Dimension 
(W  X  H) 
in  inches 

Pel  Density 
in  pels 
per  inch 

cc# 

7 

Legibility 

Chart 

400 

2 

The  Legibility  Chart  image  was  also  used  by  JBIG  in  their  algorithm 
evaluation.  This  image  contains  random  alphanumeric  and  Kanji  characters  of 
various  point  sizes. 


2.8  Sailboat  #1  (Figure  A-8) 


Image# 

Image  Name 

Dimension 
(W  X  H) 
in  bits 

Dimension 
{W  X  H) 
in  inches 

Pel  Density 
in  pels 
per  inch 

CC# 

8 

Sailboat  #1 

8x8  dither 

3072  X  2048 

7.7  X  5.1 

400 

4 

The  four  sailboat  images  were  also  used  by  JBIG  in  their  algorithm 
evaluation.  They  were  all  computer  generated  by  different  techniques  to  produce 
half-tones  at  a  pel  density  of  400  pels  per  inch.  These  images  were  all  created 
from  the  same  original  (sailboat),  but  the  principal  difference  is  the  micro-structure 
of  the  dither  pattern  (i.e.  pitch  and  angle). 


2.9  Sailboat  #2  (Figure  A-9) 


Image# 

Image  Ncune 

Dimension 
(W  X  H) 
in  bits 

Dimension 
(W  X  H) 
in  inches 

Pel  Density 
in  pels 
per  inch 

CC# 

9 

Sailboat  #2 

ERR  dif 

3072  X  2048 

m 

400 

4 

6 


2.10  Sailboat  #3  (Figure  A-10) 


Image# 

Image  Ncune 

Dimension 
(W  X  H) 
in  bits 

Dimension 
(W  X  H) 
in  inches 

Pel  Density 
in  pels 
per  inch 

cc# 

10 

Sailboat  #3 

4x4  dither 

3072  X  2048 

m 

400 

4 

2. 11  Sailboat  #4  (Figure  A-1 1 ) 


Image# 

Image  Name 

Dimension 
(W  X  H) 
in  bits 

Dimension 
(W  X  H) 
in  inches 

Pel  Density 
in  pels 
per  inch 

CC# 

11 

Sailboat  #4 

3x3  dither 

3072  X  2048 

m 

400 

4 

2.12  Dithered  Composite  Image  (Figure  A-1 2) 


Image# 

Image  Name 

Dimension 
(W  X  H) 
in  bits 

Dimension 
(W  X  H) 
in  inches 

Pel  Density 
in  pels 
per  inch 

CC# 

12 

Dithered 

Composite 

1904  X  1488 

mg 

200 

4 

This  dithered  composite  image  is  included  because  the  pel  density  of  200 
pels  per  inch  combined  with  additional  dithering  techniques  provides  variations  on 
the  half-tone  microstructure  not  provided  by  the  sailboat  images.  Starting  in 
clockwise  order  from  the  upper  left,  the  dither  patterns  are:  Ordered  4x4x4, 
Random  dithering.  Ordered  8x8  and  Clump  dithering. 

Dither  coding  is  a  process  in  which  multilevel  gray  scale  images  are 
quantized,  or  thresholded,  to  1  bit/pixel.  The  8-bit  gray  level  of  each  input  pixel  is 
compared  to  a  threshold.  The  color  of  the  1  bit  output  pixel  is  dependent  upon 
whether  the  gray  level  value  of  the  input  pixel  is  greater  than  (black)  or 
less  than  (white)  the  dithered  threshold.  Ordered  dithering  employs  a  matrix  of 
fixed  thresholds  that  is  repeated  throughout  the  image.  The  ordered  matrices  used 
in  this  document  are  the  4x4x4  matrix  and  the  8x8  matrix.  Random  dithering 


7 


employs  a  pseudo-random  number  generator  to  vary  the  8-bit  gray  level  of  each 
Input  pel  before  it  is  compared  to  a  fixed  threshold  to  determine  its  binary  color 
(black  or  white).  Clump  dithering  is  an  electronic  approximation  of  the  photo¬ 
mechanical  screening  process.  It  employs  an  irregularly  shaped  matrix  of  fixed 
thresholds.  The  thresholds  are  arranged  so  that  a  "dot"  grows  outward  from  the 
center  as  successively  darker  gray  levels  are  encountered  in  low  contrast  regions 
of  the  image  (an  emulation  of  the  photo-mechanical  dot  screen). 


2.13  Computer  Generated  Engineering  Drawing  (Figure  A-13) 


Image# 

Image  Name 

Dimension 
(W  X  H) 
in  bits 

Dimension 
(W  X  H) 
in  inches 

Pel  Density 
in  pels 
per  inch 

cc# 

13 

Computer  Gener¬ 
ated  Engineer¬ 
ing  Drawing 

1952  X  2796 

■ 

400 

3 

This  document  is  a  computer  generated  engineering  drawing  of  an  "A"  size 
schematic  rasterized  at  400  pels  per  inch.  This  document  is  representative  of 
computer-generated  drawings  created  by  computer  aided  design  (CAD)  and 
document  design  programs. 


2.14  Scanned  Engineering  Drawing  (Figure  A-14) 


Image# 

Image  Name 

Dimension 
(W  X  H) 
in  bits 

Dimension 
(W  X  H) 
in  inches 

Pel  Density 
in  pels 
per  inch 

cc# 

14 

Scanned  Engin¬ 
eering  Drawing 

3456  X  4416 

8.6  X  11 

400 

3 

This  document  is  an  electronically  scanned  engineering  drawing  of  an  "A" 
size  schematic. 
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2.15  House  Design  (Figure  A-15) 


Image# 

Image  Name 

Dimension 
(W  X  H) 
in  bite 

Dimension 
(W  X  H) 
in  inches 

Pel  Density 
in  pels 
per  inch 

CC# 

15 

House  Design 

3072  X  3040 

mm 

400 

This  simple  computer-generated  engineering  drawing  is  inverted;  i.e.  it  has 
white  lines  on  a  black  background.  This  image  is  included  to  test  compression 
algorithm  bias  towards  white  or  black.  For  example,  an  algorithm  might  assume  a 
white  background  to  increase  compression. 


2.16  Magazine  Text,  Halftone  (Figure  A-16) 


Image# 

Image  Neune 

Dimension 
(W  X  H) 
in  bits 

Dimension 
(W  X  H) 
in  inches 

Pel  Density 
in  pels 
per  inch 

CC# 

16 

Magazine  Text, 
Half-tone 

3456  X  4416 

800 

■ 

This  document  combines  a  screened  halftone  image  and  an  electronically 
scanned  text  which  has  been  inverted.  Both  the  text  and  the  image  portion  of  the 
document  have  been  extracted  from  a  magazine. 


2.17  Magazine  Page  Composite  (Figure  A-17} 


Image# 

Image  Name 

Dimension 
(W  X  H) 
in  bits 

Dimension 
(W  X  H) 
in  inches 

Pel  Density 
in  pels 
per  inch 

CC# 

17 

Magazine  Page 
Composite 

3072  X  4352 

7.7  X  10.9 

400 

■ 

This  image  is  a  composite  of  electronically  scanned  segments  of  magazine 
pages.  It  includes  a  half-tone,  text,  and  inverted  text. 
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3.0  IMAGE  CONTENT  CATEGORIES 

Through  informal  discussions  with  members  of  several  organizations 
concerned  with  image  compression,  (e.g.  JBIG,  the  NCS,  the  TR-29  committee  of 
the  TIA),  the  image  content  that  would  be  most  effective  in  testing  bi-levei 
compression  techniques  was  defined.  This  content,  listed  below,  was  employed  to 
gather  a  collection  of  bi-level  images. 

The  set  of  eight  images  collectively  known  as  the  "CCITT  Standard  Images", 
were  digitized  by  the  French  Administration  at  200  pels  per  inch  and  recorded  on 
magnetic  tape.  Although  these  images  were  never  an  official  standard,  they  have 
been  used  extensively  by  experimenters.  This  image  set  lacks  many  features  now 
required.  For  example,  the  defacto  set  did  not  contain  screened  halftones, 
electronically  dithered  images,  computer  generated  images,  or  images  at 
resolutions  higher  than  200  pels  per  inch.  Three  images  from  the  defacto  set 
together  with  a  legibility  test  chart  were  digitized  with  resolutions  up  to  480  lines 
per  inch  and  made  available  on  magnetic  tape  by  the  NCS’  in  1982.  Although 
this  NCS  image  set  added  higher  resolutions  and  limited  electronic  dithering,  it 
does  not  fully  meet  current  requirements.  The  new  set  of  images  described  in  this 
report  was  designed  to  include  these  features. 

3.1  Representative  "CCITT"  Imagery 

Imagery  similar  to  the  "CCITT  Standard  Images"  (scanned  at  200  pels  per 
inch)  is  included  so  that  comparisons  of  new  with  existing  compression  algorithms 
can  be  made  and  related  to  previous  work.  The  imagery  is  similar  to  the  "CCITT 
Standard  Images",  but  avoids  duplication  of  the  original  images,  which  are  still 
available.  The  three  selected  images  (numbers  2,  3,  and  4)  correspond  to  the 
"CCITT"  image  numbers  1,  5,  and  2  respectively.  These  represent  examples  of 
typical  business  use  of  facsimile. 


Resembling 

Image  No.  "CCITT”  No.  Name 


Figure  No. 


2 

3 

4 


1  Business  Letter 

5  Technical  Paper 

2  Handwriting 


A-2 

A-3 

A-4 


3.2  Characters 

Subjective  testing  of  bi-level  image  quality  can  benefit  from  the  use  of  an 
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established  technique  that  quantitatively  measures  the  legibility  of  printed 
characters.  Legibility  testing  has  the  following  advantages: 

It  is  measurable  with  an  objective,  numerical  performance  number; 
percent  legibility. 

it  bounds  the  quality  question  in  that  the  legibility  of  characters  is  a 
minimum  requisite  for  many  applications  of  bi-level  image  processing. 

For  these  reasons  legibility  of  character  data  is  useful  in  analyses  of  compression 
algorithms  of  the  non-image  preserving  type.  The  various  characters  that  are 
included  in  the  legibility  document  include:  English,  Arabic,  Chinese,  Russian, 
Spanish,  French  and  Kanji  characters.  Among  the  types  of  characters,  various 
parameters  are  exercised,  including  scanned  and  computer  generated  characters, 
point  size  and  font  variations.  Character  data  is  also  useful  in  the  evaluation  of 
printers.  The  bi-level  images  illustrating  alphanumeric  and  character  data  are  listed 
below. 

Image  No.  Figure  No.  Image  Name 


1 

A-l 

Facsimile  Test  Chart 

2 

A- 2 

English  letter 

5 

A-5 

Japanese  newspaper 

6 

A-6 

Mixed  text  and  screened  half-tones 

7 

A-7 

Legibility 

3.3  Engineering  Drawings  /  Line  Art 

Manually  generated  engineering  drawings  are  typically  scanned  and  stored  in 
bi-level  formats  in  document  storage  and  retrieval  systems.  In  newer  systems, 
engineering  drawings  are  computer  generated,  and  either  stored  as  vector  graphic 
images  or  as  raster  graphic  images.  The  raster  graphic  images  have  similar 
properties,  whether  manually  generated  or  computer  generated.  There  are 
differences,  of  course,  due  to  the  noise  and  artifacts  generated  by  the  scanner 
used  for  manually  generated  drawings. 

An  image  that  is  mostly  black  can  be  used  to  test  an  algorithm  for  color 
bias.  That  is,  an  algorithm  may  assume  a  white  background  to  increase 
compression.  Inverted  engineering  drawings  (e.g.  blueprints)  test  for  such  a  case. 
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Inverted  engineering  drawings  also  can  be  used  to  test  the  ability  of  printers  to 
reproduce  thin  white  lines  on  a  black  background.  The  images  illustrating 
engineering  drawings  and  line  art  are  listed  below. 


Imaae  No. 

Fiaure  No. 

Imaae  Name 

13 

A-13 

Computer  Generated  Engineering  Drawing 

14 

A-14 

Scanned  Engineering  Drawing 

15 

A-15 

Inverted  House  Design 

3.4  Halftones 

Halftone  images,  both  screened  and  dithered,  are  included  in  the  standard  bi¬ 
level  image  set  to  exercise  the  ability  of  the  compression  algorithms  to  process 
pictorial  data.  These  halftone  images  can  also  be  utilized  to  test  the  quality  and 
capabilities  of  printers  to  reproduce  halftones.  Halftone  imagery  presents  a  much 
greater  challenge  to  compression  algorithms  than  printed  characters  or  engineering 
drawings. 

Dithered  halftones  are  electronically  produced  and  represented  on  a  pixel 
basis,  screened  halftones  are  generated  photographically  and  must  then  be 
scanned.  Screened  halftone  images  are  characterized  by  artifacts  that  could  affect 
compression  algorithms.  Depending  on  the  type  of  dithering  used,  various  regular 
pixel  patterns  with  differing  pitches  are  produced.  All  halftones,  whether  dithered 
or  screened,  exhibit  very  short  black  and  white  runs  and  low  pel-to-pel  correlation. 
To  fully  exercise  compression  algorithms  and  test  the  capabilities  of  printers,  both 


types  of  halftones  are  included. 

The  selected  halftone  images  are  listed  below. 

Imaae  No. 

Fiaure  No. 

Imaae  Name 

6 

A-6 

Mixed  text  and  screened  half-tone 

8 

A-8 

Sailboat  1 

9 

A-9 

Sailboat  2 

10 

A-10 

Sailboat  3 

11 

A-11 

Sailboat  4 

12 

A-12 

Dithered  Composite 

16 

A-16 

Magazine  text,  halftone 

17 

A-17 

Magazine  page  composite 

3.5  Test  Chart  Imagery 

In  order  to  more  fully  exercise  bi-level  compression  algorithms,  computer 
generated  information  in  the  form  of  resolution  targets  and  characters  with  various 
fonts  is  included  in  the  standard  set.  Test  chart  imagery  is  also  useful  in 
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evaluating  printer  quality.  The  selected  test  chart  imagery  is  listed  below. 


Image  No.  Figure  No.  Image  Name 

1  A-1  Facsimile  Test  Chart 

3.6  "Busy"  Imagery 

Highly  detailed  or  "busy"  imagery  is  required  to  fully  exercise  compression 
algorithms  and  test  printer  quality.  The  images  that  fulfill  this  requirement  are  listed 
below. 

Image  No.  Figure  No.  Image  Name 

5  A-5  Japanese  newspaper 

17  A-17  Magazine  page  composite 

Note  that  the  quality  and  characteristics  of  optically  scanned  images  are 
dependent  upon  the  characteristics  of  the  particular  scanner  being  used.  The 
quality  and  characteristics  of  computer  generated  images,  however,  can  be 
precisely  controlled.  To  more  fully  exercise  the  compression/decompression 
algorithms  and  further  evaluate  the  qualities  and  capabilities  of  printers,  both 
scanned  and  computer  generated  images  are  included. 

4.0  IMAGE  CONSTRUCTION 

The  final  set  of  images  generated  on  this  task  resulted  largely  from  our  work 
with  the  JBIG  committee.  As  part  of  the  JBIG  algorithm  testing.  Delta  assisted  in 
preparing  three  sets  of  images,  known  as  the  "Training  Set",  the  "Evaluation  Set" 
and  the  "Stockholm  Set".  Some  of  these  images  were  generated  by  Delta,  some 
were  contributed  by  committee  members,  and  some  were  pieced  together  from  the 
first  two  categories  by  Delta.  The  following  paragraphs  describe  this  process. 

Training  Image  Set 

The  JBIG  training  image  set  was  generated  for  those  participants  who  were 
developing  compression/decompression  algorithms.  This  set  was  produced  near 
the  beginning  of  the  JBIG  work,  and  was  meant  to  provide  representative  imagery 
to  test  and  train  candidate  algorithms,  before  any  official  algorithm  evaluation. 
Participants  provided  images  to  the  JBIG  committee,  and  the  committee  selected 
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the  set  to  be  used  for  testing.  Delta  compiled  and  distributed  the  image  set, 
providing  a  common  format  from  the  various  formats  received. 


Evaluation  Images 

The  JBIG  evaluation  images  "(Evaluation  Set"  and  "Stockholm  set")  were 
generated  to  evaluate  candidate  compression  algorithm  performance.  Generic 
image  requirements  were  agreed  to  by  the  JBIG  group,  but  Delta  chose  the  actual 
imagery  to  be  included.  The  final  imagery  was  unknown  to  algorithm  participants 
until  after  all  candidate  compression/decompression  algorithms  were  frozen  (no 
further  algorithm  modifications). 

Bi-level  Image  Set  Construction 

The  construction  of  the  images  described  above  and  the  final  bi-level  image 
set  involved  several  steps.  These  steps  consisted  of  importing  the  image  into 
Delta's  computer  system  to  be  processed,  piecing  segments  of  images  together, 
complementing  (reversing  the  color  of  the  pixels)  images  or  complementing  just 
portions  of  images,  and  windowing  or  expanding  imagery  to  standard  sizes. 

The  candidate  bi-level  images  were  received  on  both  1600  pels  per  inch 
magnetic  tape  and  diskette  in  various  formats,  including  UNIX  TAR  format, 
unlabeled  format,  and  PC-DOS  format.  Computer  software  was  written  to  transfer 
the  images  from  their  respective  magnetic  media  to  Delta's  image  data  base. 

The  UNIX  TAR  format  bi-level  images  were  read  from  magnetic  tape.  The 
UNIX  TAR  format  is  composed  of  three  parts:  first,  a  header  describing  the 
structure  of  the  image's  width  and  length;  second,  a  (possible  empty)  set  of 
colormap  values;  and  third,  the  pixel  image,  stored  a  line  at  a  time,  in  increasing  y 
order.  The  UNIX  TAR  formatted  magnetic  tape  was  read  into  Delta's  computer  at 
128  words  per  record.  The  header  was  processed  to  determine  the  actual  record 
length.  Software  was  written  to  strip  the  header  from  the  image  and  change  the 
record  width  from  128  words  per  record  to  the  image's  actual  record  width. 

Bi-level  images  received  on  tape  in  unlabeled  format  (raw  data  with  no 
header  information)  were  read  into  the  computer  and  processed  directly.  The  bi¬ 
level  images  received  in  DOS  format  were  read  into  an  IBM  compatible  PC  and 
transported  to  the  computer  using  Kermit. 

Software  was  also  written  to  merge  sections  of  different  images.  For 
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example,  The  assembly  of  Image  #16  required  software  to  separate  the  halftone 
information  from  the  bi-level  information.  The  bi-level  information  was  then 
complemented  and  the  image  was  reassembled. 

Delta  sent  documents  to  Image  Works  to  be  scanned  on  an  Eikonix  scanner. 
The  scanned  images  were  returned  in  compressed  run  encoded  format.  Delta 
prepared  software  to  convert  these  images  to  raster  format.  The  rasterized 
scanned  images  were  then  read  into  the  computer  and  processed. 

5.0  SUMMARY 

The  selected  set  of  bi-level  images  are  representative  of  imagery  typically 
stored,  retrieved  or  transmitted  by  means  of  high  resolution  graphics  systems. 
Candidate  imagery  was  received  from  a  number  of  sources  associated  with  the 
JBIG  standardization  effort.  The  image  content  categories  deemed  most  effective 
in  testing  bi-level  image  compression/decompression  techniques  (listed  below) 
were  determined  and  used  as  a  basis  for  image  selection. 

o  Imagery  similar  to  CCITT  documents 
0  Characters,  legibility  test 

o  Engineering  Drawing  /  Line  Art 

0  Halftones 

o  Test  Chart  Targets 

o  High  Resolution 

The  17  images  selected  for  the  final  set  are  both  representative  of  the 
categories  listed  above,  and  have  not  been  published  previously  in  this  form.  Delta 
has  volunteered  to  maintain  an  image  repository  (under  X3L3)  for  those 
participating  in  image  standards  work.  Delta  plans  eventually  to  contribute  this  set 
to  the  image  repository,  and  we  expect  that  new  images  will  be  added  from  time 
to  time  as  the  need  arises. 

1.  NCS-TIB-82-6  -  Users  Manual  for  NCS  Facsimile  test  Document 
Tapes,  October  1982. 
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APPENDIX 
USERS  GUIDE 


Table  A-1  contains  the  parameters  of  the  selected  images. 


Figure 

Number 

Image 

Name 

Image 

Dimensions 
in  inches 

Image 

Dimensions 
in  pixels 

Pel  Density 
dpi 

A-1 

Fax  Test  Chart 

8.75  X  11.75 

3500  x  4700 

400 

A-2 

Business  Letter 

8.64  X  11.04 

3456  X  4416 

400 

A-3 

Technical  Paper 

8.64  X  11.04 

3456  X  4416 

400 

A-4 

Handwriting 

7.68  X  10.88 

3072  X  4352 

400 

A-5 

Japanese  News 

7.68  X  10.88 

3072  X  4352 

400 

A-6 

Mixed  Text  and 
Screened  Half-tones 

4.60  X  4.47 

3680  X  3578 

800 

A-7 

Legibility  Chart 

8.64  X  16.16 

3456  X  6464 

400 

A-8 

Sailboat  1 

7.68  X  5.12 

3072  X  2048 

400 

ISi 

sailboat  2 

7.68  X  5.12 

3072  X  2048 

400 

Sailboat  3 

7.68  X  5.12 

3072  X  2048 

400 

ESII 

Sailboat  4 

7.68  X  5.12 

3072  X  2048 

400 

isa 

Dithered  Composite 

9.52  X  7.44 

1904  X  1488 

200 

A-13 

Computer  Generated 
Engineering  Drawing 

4.88  X  6.99 

1952  X  2796 

400 

A-14 

Scanned  Engineering 
Drawing 

9.12  X  11.60 

3648  X  4640 

400 

A-15 

House  Design 

7.68  X  7.6 

3072  X  3040 

400 

A-16 

Magazine  Text,  Half- 
Tone 

4.32  X  5.52 

3456  X  4416 

800 

A-17 

Magazine  Page 
Composite 

7.68  X  10.88 

3072  X  4352 

400 

Table  A-1:  Image  Parameters 


Figures  A-1  through  A-1 7  are  copies  of  each  of  the  images  in  the  bi-level  image 


set. 
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FIGURE  A-1 


DELTA  INFORMATION  SYSTEMS.  INC. 

HCfiSHAM  aUSMESS  CENTER  BUHOINQ  3 
300  WELSH  nOAO 

HORSHAM.  PA  10044  TEL  (31^  •S7.S270 

FAX:  (31S)  osrssn 


January  20,  1989 


Dr.  Tasuhiro  Tamazaki 
JBIG  Chairman 

Kamifukuoka  R  &  O  Laboratories 
Kokusai  Denshin  Denwa  Co . ,  Ltd . 
2-1-15,  Ohara 

Keunifukuoka-shi,  Saitama  356 
Japan 


Dear  Mr.  Chairman, 

Enclosed  is  a  contribution  submitted  by  Delta  Information 
Systems  Inc.  to  the  ISO  Joint  Bi-level  Image  Group  (JBIG).  This 
contribution  contains  11  (eleven)  bi-level  images  to  be  used  as 
evaluation  images  for  the  JBIG  algorithm  comparison.  The 
algorithm  comparison  will  be  held  starting  February  15  in 
Morristown,  New  Jersey.  The  algorithm  comparison  will  be  based 
on  several  factors,  including  legibility,  quality,  and 
compression. 

The  evaluation  image  characteristics  match  those  that  were 
discussed  in  the  JBIG  meetings  in  London.  I  hope  you  will  find 
that  everything  is  in  order.  If  you  have  any  questions,  please 
contact  me  by  phone  or  FAX. 


^Sincerely, 

1?  ^ 

l_^yjohn  P.  DiMaggio 

Software  Systems  Analyst 

JPD:mb 

FIGURE  A- 2 
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Fig.  I.  PSDN  bMclint  nciwofk. 


Fig.  2.  Oroup  4  fax  praioGol  onicture. 


not  needed,  lince  it  it  replaced  by  the  first  flag  of  the  next 
packet.  To  preserve  the  uniqueneu  of  the  flag,  the  transmitter 
monitors  the  bits  between  die  flags.  If  it  detectt  five 
consecutive  ones,  it  automatically  stuffs  a  zero  to  avoid  six 
consecutive  ones.  The  receiver  deletes  a  zero  following  five 
consecutive  ones  to  recover  the  original  data.  For  random 
data,  the  average  number  of  stuffing  bits  is  given  by 
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where  L  is  the  number  of  biu  between  flags  (L  >  N),  and  t 
is  the  number  of  successive  ones  in  the  flag.  Here  Af  >  6,  si 
the  average  number  of  stuffing  bits  is  L/€2.  For  the  full  121 
byte  data  packet  L  b  8(128  -F  8)  ~  1088,  so  1088/62  m  1) 
stuffing  Mts  are  used  for  each  data  packet.  Since  there  ar 
SOO  000/(128  X  8)  «  489  packeU  in  the  transmiuion.  th 
stuffing  biu  add  (18  x  489)/9600  -  0.92  s  to  th 
transmUsion  time. 

Packets  for  which  the  FCS  is  in  error  are  discarded  by  thi 
receiver,  without  notifying  die  traiumitter.  However,  whei 
the  next  packet  is  received,  it  will  not  have  the  correct  seria 
number  N{Sy  This  causes  the  receiver  to  issue  a  RE 
command  to  the  sender,  and  indicates  the  serial  number  of  thi 
last  packet  received  correcdy.  The  transmitter  must  then  gi 
back  and  retransmit  all  packeU  after  the  last  one  acknowl 
edged,  in  their  correct  order.  A  dau  packet  conuins  13: 
bytes,  orS  x  138  *  1104biu,  plus  18  stuffing  biu,  fora  tou 
of  1122  biu.  With  the  assumed  bit  error  rate  of  I0~’,  th 
probability  of  at  least  one  error  in  a  packet  is  approximatel> 

1122x10-’ -0.01122. 

Neglecting  higher  order  levels  of  protocol,  the  tout  number  r 
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FIGURE  A-17 


Figures  A-18  through  A-34  are  histograms  of  the  black  and  white  run  lengths  of 
each  of  the  images.  Note  that  the  number  of  occurrences  of  all  run  lengths  are  plotted 
up  to  63,  and  then  the  occurrences  are  plotted  in  groups  of  64  run  lengths.  In  other 
words  there  are  two  scales  on  the  horizontal  axis;  the  numbers  on  the  right-hand  part  of 
the  scale  must  be  multiplied  by  64  to  determine  the  actual  run  length.  Note  also  that 
the  vertical  scale  varies  greatly  from  image  to  image,  illustrating  the  diverse  content  of 
the  images. 
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Figure  A-1 8.  Fax  Test  Chart  Histogram 
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Figure  A-20.  Technical  Paper  Histogram 
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Run  Lengths  &  Groups  of  Run  Lengths 
Figure  A-21 .  Handwriting  Histogram 
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Figure  A-22.  Japanese  News  Histogram 


A-24 


saouajjnooo  p  # 


A-25 


#  of  occurrences 


Figure  A-24.  Legibility  Chart  Histogram 
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Figure  A-25.  Sailboat  1  Histogram 
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Figure  A-26.  Sailboat  2  Histogram 
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Figure  A-27.  Sailboat  3  Histogram 
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Figure  A-28.  Sailboat  4  Histogram 
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Figure  A-29.  Dithered  Composite  Histogram 
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Figure  A-30.  Computer  Generated  Engineering  Drawing  Histogram 
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Figure  A-31.  Scanned  Engineering  Drawing  Histogram 


A-33 


Figure  A-32.  House  Design  Histogram 
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Figure  A-33.  Magazine  Text,  Half-Tone  Histogram 
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Figure  A-34.  Magazine  Page  Composite  Histogram 
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MAGNETIC  MEDIA  DATA  FORMAT 


9-Track  Magnetic  Tape 

The  seventeen  images  are  recorded  sequentially  on  1/2"  9-track  1600  bpi 
magnetic  tape.  The  images  are  written  to  magnetic  tape  in  ANSI  X3.27  format.  Figure 
A-35  shows  an  example  of  the  ANSI  standard  tape  format.  The  Volume  Header  Label 
(VOL1)  identifies  the  tape  and  is  followed  by  Header  Labels  (HDR1  &  HDR2),  image 
data,  and  Trailer  Labels  (EOF1  &  EOF2)  groups  for  each  image  data  file  recorded  on  the 
tape. 

The  Volume  Tape  Label  contains  volume  identification,  accessibility  and  owner 
identification  and  is  the  first  record  on  the  tape.  The  Volume  Label  is  followed  by  image 
files  which  are  delimited  by  header  and  trailer  labels  and  end  of  file  marks  (TM).  The 
first  header  label  contains  the  file  identifier  (file  name),  any  file  set  information  for  the 
image,  generation  data,  block  length  and  record  length  information.  The  trailer  labels 
mirror  the  header  labels,  but  also  contain  the  tape  block  count  for  the  image  file.  After 
the  last  image  file  on  the  tape  the  two  end  of  file  trailer  labels  (EOF1  &  EOF2)  are 
replaced  with  end  of  volume  labels  (EOV1  &  EOV2)  and  two  file  marks  (TM)  denoting 
end  of  tape.  The  volume  labels,  header  labels  and  trailer  labels  are  fully  defined  in  ANSI 
Standard  X3.27. 

The  Image  files  were  created  by  scanning  the  image  from  left  to  right  and  top  to 
bottom.  The  upper  left  pel  of  each  image  Is  represented  by  the  most  significant  bit 
(MSB)  of  the  first  byte  of  the  image  file.  One  (1)  represents  black,  and  zero  (0) 
represents  white.  The  images  are  written  to  tape  in  blocks.  The  tape  block  size  is  a 
compromise:  large  blocks  use  the  tape  more  efficiently  but  require  larger  buffers.  Also, 
some  computers  can  not  handle  large  tape  blocks.  Table  A-2  contains  the  order  of  the 
Images  and  all  the  necessary  information  to  extract  and  use  the  images  from  the  tape. 

Floppy  Diskettes 

Seventeen  images  are  recorded  on  the  floppy  diskette  set.  The  floppy  diskettes 
are  formatted  for  a  1.2  Megabyte  high  density  diskette  drive  for  use  on  an  IBM  or 
compatible  PC  under  the  MS-DOS  or  PC-DOS  operating  systems.  Images  are  stored  one 
image  per  diskette.  The  larger  images  that  do  not  fit  on  one  diskette  are  stored  on 
multiple  diskettes  in  DOS  Backup  format.  The  format  of  the  files  on  diskettes  is  similar 
to  that  of  magnetic  tape.  That  is,  the  upper  left  pel  of  each  image  is  represented  by  the 
MSB  of  the  first  byte  of  the  image  file.  One  (1)  represents  black,  and  zero  (0) 
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represents  white.  Again,  since  the  image  is  stored  on  the  diskette  as  a  byte  stream,  the 
parameters  in  Table  A-2  must  be  used  to  retrieve  the  image. 
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Figure  A-35.  X3.27  Magnetic  Tape  Format 
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TABLE  A-2.  MAGNETIC  TAPE  FORMAT  DATA 
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Feet  of  tape  =  1672 


